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INTRODUCTION 

Programmatics 

This  paper  documents  one  aspect  of  the  ongoing  FY  01  In-house 
Laboratory  Independent  Research  Project  CRANOF  (a  Complexity- 
Reducing  Algorithm  for  Near-Optimal  Fusion),  Project  ZU014, 
with  Principal  Investigator,  Dr.  D.  Bamber,  and  co-investigator, 

Dr.  I.  R.  Goodman  (both  SSC  San  Diego),  and  with  associate  support 
from  Dr.  W.  C.  Torrez  (SSC  San  Diego)  and  Prof.  H.  T.  Nguyen 
(Department  of  Mathematical  Sciences,  New  Mexico  State  University 
and  U.S.  Navy  American  Society  for  Engineering  Education  Fellow 
during  summers  at  SSC  San  Diego).  A  preliminary  version  of  this 
paper  can  be  found  in  [1,  section  3.3]. 

Background  on  Underconslrained  Conditional  Probability  Problems 

Philosophy  of  Approach  and  General  Motivations 

To  improve  the  timeliness  and  accuracy  of  decision-supported  human 
decision-making,  one  is  faced  with  an  array  of  crucial  problems,  including 
how  to  handle  large  amounts  of  incoming  and  uncertain  information  from 
disparate  sources.  These  sources  can  be  human-based  or  mechanical- 
based,  and  the  information  can  arrive  in  different  forms,  such  as  qualita¬ 
tive  and  linguistic,  numerical  and  statistical-probabilistic,  or  some  mixture 
of  both.  At  SSC  San  Diego,  the  CRANOF  project  addresses  such  crucial 
issues  solely  within  the  realm  of  statistics  and  probability.  The  issue  of 
underconstrained  or  underspecified  probabilities  is  treated  by  a  novel 
use  of  second-order  probabilities  (i.e.,  probabilities  of  probabilities)  in 
Bayesian  framework.  Underconstrained  probabilities  arise  in  a  wide  vari¬ 
ety  of  problems,  including  quantitatively  formulated  rule-based  systems, 
tracking  and  correlation,  assessment  of  network  intrusions,  information 
retrieval,  and  simulation  of  human  behavior  in  war  games.  This  paper 
serves  as  a  beginning  extension  of  the  capabilities  of  CRANOF  to  include 
linguistic-based  information. 


ABSTRACT 

This  paper  covers  issues  relating 
to  the  establishment  of  a  sound 
and  conditional  probability- 
compatible  rationale  for  generating 
linguistic-based  inference  rules 
concerning  a  population.  By 
extending  previous  preliminary 
results,  we  detail,  in  a  fully 
rigorous  manner  and  within  the 
confines  of  traditional  probability 
theory,  that  a  comprehensive 
technique  can  be  derived  that  con¬ 
verts  linguistic-based  conditional 
information,  couched  only  in 
fuzzy-logic  terms,  into  naturally 
corresponding  conditional  proba¬ 
bilities.  In  turn,  we  demonstrate 
how  such  typically  undercon¬ 
strained  conditional  probabilities 
can  be  combined  for  suitable 
conclusions  and  decision-making, 
via  a  new  use  of  second-order 
probability  logic.  This  research 
is  part  of  the  ongoing  SSC  San 
Diego  In-house  Laboratory 
Independent  Research  FY  01 
project  CRANOF  (a  Complexity - 
Reducing  Algorithm  for  Near- 
Optimal  Fusion ). 
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Quantitatively  Formulated  Rule-Based  Systems 

Consider  quantitatively  formulated  rule-based  systems,  with  the  rules  or 
conditional  relations  symbolized  typically  as  (aj  |  bj),  (a2  |  b2),...— read  "if 
bj,  then  aj"  (or  equivalently,  "aj,  given  bj,"  etc.),  "if  b2,  then  a2,"...,  where 
events  or  sets  aj,  bj,  a2,  b2,...  may  themselves  represent  quite  complicated 
logical  combinations  of  simpler  events  or  sets,  and  where  it  may  or  may 
not  be  known  what  logical  relations  exist  among  such  events.  Each  such 
rule  is  also  assigned  quantitative  reliability  in  the  form  of  naturally  corre¬ 
sponding  conditional  probabilities.  Thus,  for  some  otherwise  unspecified 
probability  measure  P,  rule  (a|b)  is  assigned  value  P(a|b)  =  P(ab)/P(b),  the 
conditional  probability  of  a  given  b,  using  standard  Boolean  and  probabil¬ 
ity  notation  and  assuming  antecedent  probability  P(b)  >  0.  Because  typical 
rule  (a|b)  is  not  perfect,  in  general  P(a|b)  <  1,  but,  on  the  other  hand,  one 
would  expect  P(a|b)  to  be  reasonably  high.  A  common  problem  that  such 
rule-based  systems  address  is:  Consider  incoming  information  in  the  form 
of  events,  dj,...,  dn,  possibly  gleaned  from  different  sources,  such  as  dj  = 

"visibility  is  up  to  1  mile,"  d2  =  "winds  between  15  mph  and  30  mph,"  d3 
=  "enemy  movement  detected  last  night  in  Sector  C,"...,  dn  =  "political 
situation  with  enemy  country  Q  at  level  R,"  and  a  collection  of  reason¬ 
ably  related  rules,  such  as  (a^bj),  (a2|b2),...,  (am|bm),  where  the  aj,  bj 
involve  not  only  parts  or  all  of  the  dj  (or  various  logical  combinations  of 
them),  but  possibly  other  related  events  (or  logical  combinations  of  such). 

Then,  one  wishes  to  test  for  viability  of  possible  decisions,  based  upon 
this  information,  such  as  c1  =  "fully  successful  attack  by  us  can  be  accom¬ 
plished  by  attacking  in  Sectors  C  or  D,"  c2  =  "partially  successful  attack 
by  us  can  be  accomplished  by  attacking  Sectors  D  or  H,"...  .  Symbolically, 
one  is  considering  the  validity  or  degree  of  validity  of  the  entailment 
schemes  G;  =  [(ajibj),...,  (an|bn);  (cjd)],  i  =  1,  2,... ,  where  d  =  dj&.-.&dn 
(conjunction  of  all  data),  and  where  ((ajlbj),...,  (an|bn))  can  be  considered 
the  premise  set  of  Gj  and  (cjd)  its  potential  conclusion.  Ideally,  one  would 
like  to  know  just  what  each  P(cjd)  would  be,  based  on  having  either,  say, 
the  exact  threshold  situation  holding,  i.e.,  P(aj|bj)  =  tj,  j  =  1,...,  n,  or,  the 
lower  bound  threshold  situation  holding,  i.e.,  having  P(aj|bj)  >  tj,  where  all 
the  thresholds  tj  are  known  or  estimable  in  either  situation.  However,  in 
general,  it  is  readily  demonstrated  that  the  n  equalities  (or  inequalities)  are 
not  enough  to  determine  P  and/or  P(cjd)  completely.  Thus,  one  is  faced 
with  the  problem  of  best  estimating,  in  some  sense,  just  what  P  and/or 
P(cjd)  should  be. 

Adams'  Approach  to  Analyzing  Quantitatively  Formulated  Rule-Based  Systems 

In  a  series  of  papers  [2,  3],  E.  W.  Adams  proposed,  in  effect,  the  estimate 
of  P(cj|d)  to  be  a  pessimistic  one  in  the  form  of  his  "minimum  conclusion" 
function,  using  multivariable  abbreviation  tj  for  (tj)j  jn  j,  (a|b)j  for  (aj|bj)j  ;n  j, 

P(a|b)j  >  tj  for  P(aj|bj)  >  tj,  j  in  J,  lj  for  column  vector  of  all  l’s  indexed 
by  J,  etc., 

estimateHpL  of  (P(cjd)  from  G;) 

=  minconc(G;)(tj)  =  inf{P(cJd):  for  all  possible  probability  measures  P  such  that  P(a|b)j  >  tj},  (1) 
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with  P(c;|d)  for  the  exact  threshold  situation  analogously  estimated.  The 
subscript  (  )Hpl  is  used  to  indicate  "High  Probability  Logic,"  since  Adams 
also  introduced  the  idea  of  an  entailment  scheme  being  HP-valid  or  HP- 
invalid,  which,  in  the  case  of  any  G;  here  simply  means  for  the  former 
that 

G;  is  HPL-valid  iff  limit(minconc(G;)(tj))  =  1. 

(tj’T  ij) 

But,  unfortunately,  both  the  minconc  function  and  its  limiting  forms  to 
test  for  HPL-validity/invalidity  produce  a  number  of  results  very  much  at 
odds  with  commonsense  reasoning,  including  the  fact  that  three  very  fun¬ 
damental  entailment  schemes,  transitivity  (or  hypothetical  syllogism)  [(a|b), 
(b|c);  (a|c)]  (the  heart  of  any  rule-based  system);  contraposition  [(a|b); 
(b'|a')J;  and  strengthening  of  antecedent  [(a|b);  (a|bc)]  are  all  HPL-invalid. 
In  fact,  one  can  find  P’s  that  satisfy  their  premise  thresholds  for  any 
choice  of  tj  close  to  (but  not  exactly  equal  to)  lj,  but  for  which  the  corre¬ 
sponding  conclusion  probabilities  are  arbitrarily  close  to  (or  actually 
equal  to)  0.  Moreover,  more  generally,  Eq.  (2)  can  be  complemented  by 
the  fact  that  any 

G;  is  HPL-invalid  iff  limit(minconc(G;)(tj))  =  0. 

(tjTq) 

Finally,  Adams  pointed  out  another  type  of  validity,  CPL  (Certainty 
Probability  Logic),  that,  although  still  based  on  the  minconc  function,  can 
be  characterized  as  "too  optimistic"  in  contrast  with  HPL,  whereby  the 
criterion  is 

G;  is  CPL-valid  iff  minconc(G;)(lj))  =  1. 

Close  connections  exist  between  CPL  validity/invalidity  (the  latter  satis¬ 
fying  a  relation  analogous  to  that  of  Eq.  (3))  and  that  of  CL  (classical 
logic)  validity  or  invalidity,  noting 

G;  is  CL-valid  iff  &(b'v  ab)j  d'v  c;d. 

(For  further  analysis,  criticism,  and  extension  of  Adams’  ideas,  see  [3].) 

CRANOF  Approach  to  Analyzing  Quantitative  Rule-Based  Systems  and 
Other  Underconstrained  Probability  Problems 

The  previous  conclusions  show  that  the  minconc  function  is  not  a  reason¬ 
able  measure  (for  reasonably  high  thresholds)  of  the  degree  of  validity/ 
invalidity  of  an  entailment  scheme  and  also  show  that  the  HP-validity/ 
invalidity  test  is  too  stringent.  Therefore,  it  seemed  natural  to  replace  the 
extremal  minconc  function  by  the  more  moderating  meanconc  function 
(well-justified  from  decision  analysis  in  the  form  of  conditional  expecta¬ 
tion  and  justified  as  always  admissible,  least-squares  error,  etc.  — see  any 
standard  texts  such  as  Rao  [4]  or  Wilks  [5])  within  a  Bayesian  framework, 
where  the  unknown  probability  measure  P  here  is  treated  as  a  random 
quantity  with  some  appropriately  assigned  prior  distribution,  subject  to 
the  given  premise  set  threshold  constraints.  Utilizing  additional  new  theo¬ 
retical  results  [6],  an  "optimal"  choice  of  prior  or  priors  essentially  must 
come  from  the  well-known  Dirichlet  family  of  distributions.  It  should  be 
noted  that,  unlike  the  minconc  function,  the  meanconc  function  in  the 
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unity-limiting  threshold  case  can  take  on  nontrivial  values  and,  in  a  natu¬ 
ral  sense,  at  any  fixed  threshold  level,  provides  a  reasonable  measure  of 
degree  of  validity  of  that  entailment  scheme  under  consideration.  In  par¬ 
ticular,  in  full  agreement  with  commonsense  reasoning,  transitivity,  con¬ 
traposition,  and  strengthening  of  antecedent  are  all  SOPL-valid,  where 
SOPL  stands  for  Second-Order  Probability  Logic  and  where  one  defines 
validity  of  any  Gj  as 

G;  is  SOPL-valid  iff  limit(meanconc(G;)(tj))  =  1,  (6) 

(tjTij) 

SOPL-validity  depending  on  some  degree,  of  course,  on  the  particular 
choice  of  prior  for  P.  However,  it  has  been  pointed  out  (Bamber  [7]  and 
personal  communications)  that  the  limit  in  Eq.  (4)  remains  the  same  as  if 
the  prior  of  P  is  a  uniform  distributional  one,  when  the  corresponding 
probability  density  function  is  bounded  uniformly  above  and  below 
(from  zero)  over  its  natural  domain  (again,  see  references). 

Also,  see  [8]  for  additional  background  on  both  the  theoretical  structure 
of  the  meanconc  function  and  its  practical  implementational  form 
CRANOF — whereby  a  significant  reduction  in  the  complexity  of  com¬ 
puting  meanconc(G;)(tj)  is  achieved  by,  in  effect,  reducing  the  premise  set 
of  G;  to  a  single  constraint,  also  taking  into  account  the  unity-limiting 
threshold  behavior  of  meanconc  ([7]).  Finally,  Table  1  is  presented  below 
to  illustrate  a  few  typical  evaluations  of  meanconc(G)  for  relatively  simple 
entailment  schemes  G  with  P  assigned  a  uniform  prior  distribution  [8]. 

TABLE  1 .  Abridged  table  of  calculations  of  degree-of-entailment  functions,  minconc  and  meanconc,  for  fixed  threshold  levels,  and  a 
comparison  of  CPL-,  SOPL-,  and  HPL-validities  for  different  types  of  entailment  schemes. 


Name  of 

Entailment  Scheme 

D  =  [(«| %  (c|c?)] 

Given  Levels 
of  Premises: 
¥(a\b\  =  tj, 
for  otherwise 
arbitrary 
prob.  meas.  P 

minconc(D)(tj) 
(inequality 
threshold  form) 

meanconc(D)(tj), 
assuming  uniform 
prior  for  P's  (exact 
threshold  form) 

D  is 
CPL- 
valid  ? 

D  is 

SOPL- 

valid? 

D  is 

HPL- 

valid? 

Cautious 

Monotonicity: 
[(a|b),(c|b);  (a|bc)] 

P(a|b)  =  s, 

P(c|b)  =  t 

>  max(s+t-l,0) 

>  max(s+t-l,0) 

YES 

YES 

YES 

Transitivity: 

[(a|b),  (b|c);  (a|c)] 

P(a|b)  =  s, 

P(b|c)  =  t 

0 

=  st  +  (1— 1)/2  -  p(s,t)/q(s,t), 
p(  s,t)  =  s(l— s)(2s— l)t(l— t2), 
q(s,t)  =  t+2t2+ 

(s(l-s)(l-t)(2+3t-t2) 

YES 

YES 

NO 

Contraposition: 

[(a|b);  (b'|a )] 

P(a|b)  =  t 

0 

j,  ,  (l-t)log(l-t) 
t2 

YES 

YES 

NO 

Positive 

Conjunction: 

[(a|b),(a|c);  (a|bc)] 

P(a|b)  =  t, 

P(a|c)  =  t 

0 

(l+t)/3  +  [((l+t)(2-t)/(3t))  0(t)], 

0(t) 

=  (t2/4)[log((2  t)/t)]/(l— t) 

-  (( 1  t)2/4)*log((l  +t)/(l— t)) 

YES 

YES 

NO 

Nixon  Diamond: 
[(ab|c),(d|a),(d'|b);  (d|c)] 

P(ab|c)  =  s, 
P(d|a)  =  t, 
P(d'|b)  =  t 

0 

1/2 

YES 

NO 

NO 

Abduction: 

[(a|b),  a;  b] 

P(a|b)  =  s, 

P(a)  =  t 

0 

If  s  >  t  :  t/(2s), 

Tr  t3  S(l-t)2 

S<t'  2(t2 - 2st  +  s)2 

NO 

NO 

NO 
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EXTENDING  APPLICABILITY  OF  CRANOF  TO  LINGUISTIC-BASED 
SYSTEMS 

In  considering  linguistic-based  information  in  rule-based  systems  and  in 
formulating  the  linguistic  analogue  of  the  underconstrained  conditional 
(including  unconditional)  probability  problem,  the  role  of  fuzzy  logic 
comes  immediately  to  mind.  This  is  based  in  part  on  the  great  practical 
success  of  fuzzy  logic  in  running  systems  such  as  elevators,  washing 
machines,  etc.,  and  on  the  now  very  large  body  of  scientific  literature  sup¬ 
porting  the  modeling  of  linguistic  information,  relations,  and  decision 
processes  via  fuzzy  logic.  (See,  e.g.,  past  Proceedings  of  IEEE  Interna¬ 
tional  Conferences  on  Fuzzy  Systems  or  the  Proceedings  of  the  Joint 
Conference  on  Information  Sciences,  as  well  as  basic  texts,  such  as  Dubois 
&  Prade’s  now  classic  treatise  [9]  and  Nguyen  &  Walker’s  [10].) 

On  the  other  hand,  there  still  exists  a  lively  controversy  considering  the 
merits  of  using  probability  theory  and  techniques  in  place  of  fuzzy  logic 
and  vice  versa.  (See  Goodman’s  summary  and  listing  of  literature  papers 
directly  involved  in  this  controversy  [11].)  This  leads  to  the  following  area 
in  which  this  author  and  H.  T.  Nguyen  have  played  some  role  over  the 
past  several  years:  the  issue  of  the  possible  direct  connection  between  fuzzy 
logic  and  probability  theory  [12,  13,  and  1].  Until  this  is  completely 
resolved,  it  is  this  author’s  opinion  that  a  comprehensive  view  of  data 
fusion,  which  both  theoretically  and  practically  integrates  linguistic-based 
information  with  probabilistic-based  information,  will  not  be  achieved.  In 
particular,  this  applies  to  rule-based  systems,  where  the  fuzzy  logic  com¬ 
munity  has  developed  a  common  approach  that  is  claimed  to  be  more  sat¬ 
isfactory  than  any  probability  approach. 

This  paper  once  again  points  out  the  existence  of  deep,  but  tractable,  rela¬ 
tions  among  fuzzy  logic,  linguistic-based  principles,  probability  theory, 
and  commonsense  reasoning  mainly  through  the  use  of  two  basic  mathe¬ 
matical  tools:  SOPL/CRANOF  (as  briefly  described  in  the  first  section), 
and  the  representation  theory  of  fuzzy  sets  by  the  one-point  coverages  of 
random  sets  (see  [12,  13])  in  conjunction  with  other  recently  developed 
mathematical  tools  ( conditional  and  relational  event  algebra  [14;  15,  sec¬ 
tion  3]).  In  particular,  homomorphic-like  relations  were  established,  con¬ 
necting  fuzzy-logic  concepts  and  corresponding  random-set  concepts, 
where  each  fuzzy-set  membership  function  involved  is,  in  effect,  inter¬ 
preted  as  the  weakest  way  to  specify  any  of  a  class  of  corresponding  ran¬ 
dom  subsets  of  the  fuzzy  set’s  domain.  These  relations  include  natural 
random-set  interpretations  of  various  combinations  of  fuzzy-logic  opera¬ 
tors  and  Zadeh’s  well-known  "extension  theorem."  This  time,  these  con¬ 
nections  are  extended  to  include  the  formulation  and  use  of  inference 
rules  obtained  from  a  population  of  interest.  The  results  presented  here 
extend  preliminary  efforts  provided  in  Goodman  &  Nguyen  [1],  where  it 
was  demonstrated  that  one  type  of  fuzzy-logic  approach  to  the  modeling 
of  inference  rules  for  a  population,  relative  to  a  given  collection  of  attrib¬ 
utes,  using  the  ratio  of  fuzzy  cardinalities  or  averaged  membership  level  of 
the  attributes,  could  also  be  interpreted  in  a  probability  framework.  In 
addition,  by  using  similar  techniques,  it  is  shown  how  other  fuzzy-logic 
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concepts,  commonly  thought  of  as  not  directly  relating  to  probability, 
may  now  also  be  put  into  a  complete  probabilistic  setting,  including  the 
illustration  for  normalization  of  membership  functions. 

MATHEMATICAL  RESULTS  ESTABLISHING  GENERAL  FUZZY  LOGIC 
POPULATION  CONDITIONING  PROBLEM  AS  AN  UNDERCON¬ 
STRAINED  CONDITIONAL  PROBABILITY  PROBLEM  TREATABLE 
VIA  SOPL/CRANOF 

As  in  the  previous  sections,  standard  Boolean  algebra  and  probability 
theory  notation  will  be  employed,  with  [0,1]  indicating  unit  interval;  {0,1} 
indicating  the  two  element  set  containing  0,  1;  R  indicating  the  real  (or 
Euclidean)  line  and  Rm  indicating  the  real  (or  Euclidean  m-space),  P(D) 
indicating  the  power  class  of  D  (sometimes  written  2° — the  class  of  all 
subsets  of  D),  etc.  "Equal  by  definition"  is  denoted  as  =j.  For  background 
on  copulas,  see  Schweizer  &  Sklar  [16]  and  the  recent  excellent  mono¬ 
graph  by  Nelsen  [17].  Recall  that  copulas  are  any  joint  cdf’s  (cumulative 
probability  distribution  functions),  all  of  whose  one-dimensional  marginal 
cdf’s  correspond  to  identical  uniformly  distributed  rv’s  (random  variables) 
over  [0,1]. 

Theorem  1.  Modification  of  Goodman  [18] 

Let  D  be  a  finite  set,  f,  g:D — >[0,1]  any  two  fuzzy-set  membership  func¬ 
tions,  and  cop:  [0, l]DxD — >[0,1]  any  copula  with  that  domain,  with  (x,y)- 
marginal  copulas  indicated  by,  e.g.,  copx  y,  x,  y  in  D,  etc.  Then: 

(i)  There  is  a  probability  space  (ft,B,P)  and  a  joint  collection  of 
0-1-valued  rv’s,  Zf  x,  Zg)y:ft— >{0,1},  for  all  x,  y  in  D  with  overall  joint  cdf 
^cop  =  c°Po((Ff,x)x  in  d,  (Fg,y)yinD):  RDxD— »[0,1]  (via  Sklar ’s  Theorem 
[16]),  and,  indicating  the  joint  marginal  (x,y)-components  of  cop,  as 
copX;y,  the  joint  cdf  of  (Zf  x,  Zg  y)  is,  correspondingly,  Ef;g,COp,x,y(-,  ••)  = 
coPx,y°(Ff,x(-)!  Fg  y(..)),  where  0  indicates  functional  composition  and  Ef;X; 

Eg  v  are  each  one-dimensional  cdf’s  corresponding  to  mass-point  probability 
functions  hf  x,  h g  y,  respectively,  where 

p(zf;X  =  i)  =  Mi)  =  f«;  p(zf,x  =  o)  =  M°)  =  !-f(x); 

P(Zg;y  =  i)  =  ^g,y(i)  =  g(y);  p(zg,y  =  o)  =  ^g,y(°)  =  !-g(y);  (7) 

whence 

[  0,  if  s  <  0,  [  0,  if  s  <  0, 

Ff,x(s)H  if  0  =S  s  <  1,  Tgy(s)=j  l-g(y),  if  0  s  <1,  all  x,  y  in  D  (8) 

l  1,  if  1  s;  l  1,  if  1  =£  s; 

(ii)  Define  random  sets  S(f,  cop),  S(g,  cop):ft— >P(D),  S(f,  g,  cop): 
ft— >P(D)  xP(D)  as  follows,  for  each  CO  in  ft: 

S(f,  g,  cop)(co)  =  S(f,  cop)(0))x  S(g,  cop)(o})  =  {(x,y):  x,  y  in  D,  ZfiX(co)  Zg)y(©)  =  1); 

S(f,  cop)(co)  =  [x:  x  in  D,  ZfiX(co)  =  1};  S(g,  cop)(co)  =  jy:  y  in  D,  Zg  y(c0)  =  1};  (9) 

whence,  by  straightforward  combinatoric  considerations,  the  entire 
probability  distributions  of  the  marginal  random  subsets  of  D,  S(f,  cop), 

S(g,  cop),  as  well  as  the  joint  random  subset  of  DxD,  S(f,  g,  cop),  are  com¬ 
pletely  determined. 
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(iii)  For  any  x,  y  in  D,  the  following  equality  of  one-point  coverage  events 
hold: 

(x  in  S(f,  cop))  =  (Zf;X  =  1) ;  (y  in  S(g,  cop))  =  (Zg>y=  1);  (10) 

((x,y)  in  S(f,  g,  cop))  =  (x  in  S(f,  cop))  &  (y  in  S(g,  cop))  =  (Zf,x  =  1)  &  (Zg>y=  1).  (11) 

(iv)  For  any  x,  y  in  D,  the  following  one-point  coverage  representations 
for  f,  g  hold: 

P(x  in  S(f,  cop))  =  P(Zfx  =  1)  =  f(x) ;  P(y  in  S(g,  cop))  =  P(Z&y=  1)  =  g(y);  (12) 

P((x  in  S(f,  cop))  &  (y  in  S(g,  cop)))  =  P((Zf>x  =  1)  &  (Z&y=  1)) 

-  1-  P(ZfjX  =  0)  -  P(Zg>y  =  0)  +  P((ZfjX  =  0)  &  (Zg)y=  0)) 

=  1-  P(ZfjX  =  0)  -  P(Zg>y  =  0)  +  P((Zf,x<  0)  &  (Zg,y<  0)) 

=  f-  (i-f(x))  -  (i-g(y))  +  Ff,g,coPX;y(o>  o) 

=  f(x)  +  g(y)  -  1  +  copX;y(l-f(x),  l-g(y)) 

=  f(x)  +  g(y)  -  cocopXiy(f(x),  g(y)) 

=d  copX;yA(f(x),  g(y)),  (13) 

where  we  use  the  relation 

Ff,g,copX;y(0j  0)  =  coPx,y°(Pf,x(0)>  -^g,y(0)) 

=  copx>y°(&f>x(0),  hg  j{ 0)) 

=  copX;y«(l-f(x),  l-g(y)) 

and  where  the  functions  cocop,  copA  are  called  the  cocopula,  survival  cop¬ 
ula,  respectively,  of  cop  (the  latter  apparently  being  the  special  designation 
of  Nelsen  for  modular  transform  [17,  section  2.6]),  where,  for  any  s,  t  in 
[0,1]: 

cocop(s,  t)  =d  1  -  cop(l-s,  1-t)  ;  copA(s,  t)  =<|  s+t  -  cocop(s,t).  (14) 

(v)  Specializing  (iv)  for  x  =  y  in  D  arbitrary, 

P(x  in  S(f,  cop)  n  S(g,  cop))  =  P((x,x)  in  S(f,  g,  cop))  =  copX;yA(f(x),  g(x)).  (15) 

(vi)  As  copula  cop  is  allowed  to  vary  arbitrarily,  the  full  solution  set  of 
distribution-distinct  random  subsets  of  D  that  are  one-point  coverage 
equivalent  to  f,  g,  respectively  in  the  sense  of  Eq.  (12),  is  exhausted.  ■ 

Remark  1.  Note  first  that  cocop  is  the  DeMorgan  transform  of  cop  — so 
that  if  one  thinks  of  cop  as  a  generalized  conjunction  or  "and"  operator— 
as  in  fuzzy  logic  (with  the  usual  desirable  properties  of  being  nondecreas¬ 
ing  in  its  arguments  and  having  appropriate  boundary  properties  when 
one  of  the  arguments  is  0  or  1),  then,  naturally,  cocop  can  be  thought  of  as 
a  general  disjunction  or  "or"  operator.  Nelsen  [17,  section  2],  shows  that 
the  survival  copula  is  always  a  legitimate  copula  and  shows  the  characteri¬ 
zation 

copA  =  cop  iff  cop  is  radially  symmetric,  (16) 

where  the  latter  means  that  the  joint  r.v.  Y  represented  by  cop  is  such  that 
Y  -  (V2,  V2)  and  (V2,  V2)  -  Y  have  the  same  distribution.  In  particular, 
radial  symmetry — and  hence  the  validity  of  Eq.  (16) — holds  for  all 
Gaussian  copulas  'Pp  (*P-1(.),  VP-1(..)),  where  'Pp  is  the  joint  cdf  of 

distribution  Gaussian  (02,  ( 1  Pi)  and  *P  is  the  cdf  of  the  standardized 

\p  1/ 
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one-dimensional  Gaussian  distribution  Gaussian  (0,1)  and  all  of  Frank’s 
Archimedean  copula  family  [17],  [19]  (i.e.,  associative,  commutative  with 
cop(s,s)  <  s,  for  0  <  s  <  1) — which  includes  the  copulas  prod  and  minsum, 
as  well  as  the  special  copula  min,  where  for  all  s,  t  in  [0,1],  min,  prod  are 
the  usual  arithmetic  minimum  and  product  of  s,  t,  respectively,  while  min- 
sum(s,t)  is  given  as 

minsum(s,t)  =  min(s+t-l,  0).  (17) 

■ 

Theorem  2.  Extension  of  Goodman  &  Nguyen  [13] 

Suppose  that  D  is  a  finite  set,  f,  g:D— >[0,1]  are  any  two  fuzzy  set  member¬ 
ship  functions,  cop:  [0,l]DxD— >[0,1]  is  any  copula  with  that  domain,  and 
w:D— >[0,1]  is  a  probability  function.  Define 

((flg)coP,w  =d  2  (  w(x)-copA(f(x),  g(x)))  /  X  (  w(x)-g(x)).  (18) 

xinD  xinD 

Then,  in  the  sense  of  Theorem  1,  there  is  a  probability  space  (£2,B,P)  and 
random  sets  S(f,  cop),  S(g,  cop):  £1  — >P(D),  S(f,g,  cop):  with  the  one-point 
coverage  relations  holding  as  in  Eqs.  (12),  and,  without  loss  of  generality, 
there  exists  a  random  variable  V:£2— >D,  independent  of  S(f,  g,  cop),  and 
hence  of  S(f,  cop),  S(g,  cop),  such  that  the  probability  function  of  V  is  w, 
so  that 

(flg)cop,w  —  cop  I  ^"g,  cop  ),  (19) 

an  ordinary  conditional  probability,  where  events  a£COp  ,  b„COp  in  B  are 
defined  as  the  two-stage  randomization  events 

aU  cop  =d  (V  ln  S(f>  coP))  -  ^g,  cop  =  (v  in  S(g  C0P))  >  (20) 

so  that  in  reduced  form, 

PH  cop  I  ^g,  cop)  =  PH  cop&  ^g,  cop  I  ^g,  cop)  =  P(V  in  S(f-  cop)n  S(g,  cop))  /  P(V  in  S(g  cop)).  (21) 

Proof:  Use  the  usual  conditioning  property  of  probabilities,  independence 
of  V,  and  Eq.  (11)  at  each  outcome  of  r.v.  V, 

P(V  in  S(f,cop)  and  V  in  S(g,  cop))  =  Ev(P(V  in  S(f,cop)  and  V  in  S(g,  cop)  |  V)) 

=  Ev(copA(f(V),  g(V)))  =  X  (  w(x)-copA(f(x),  g(x))).  (22) 

xinD 

Similarly  (and  more  simply),  now  using  Eq.  (12)  in  place  of  Eq.  (13), 

p(V  in  S(g,  cop))  =  Ev(P(V  in  S(g,cop)  |  V))  =  Ev(g(V))  =X  (  w(x)-g(x)).  (23) 

xinD 


The  desired  results  hold  by  dividing  Eq.  (22)  by  Eq.  (23).  ■ 

Remark  2  and  an  Example.  In  Theorem  2,  for  the  special  case  of  w  cor¬ 
responding  to  a  uniform  distribution  over  population  D,  canceling  the 
l/card(D)  factor,  and  usually — but  not  always  choosing  cop  to  be  either 
min  or  prod — the  numerator  of  the  quantity  (f|g)copi  w  reduces  to  the 
popular  fuzzy-logic  concept  of  th e  fuzzy  cardinality  of  f  "and"  g  for  pop¬ 
ulation  D,  i.e.,  to  what  extent  the  entire  population  D  has  characteristics 
described  by  f  "and"  g,  while,  similarly,  the  denominator  represents  the 
fuzzy  cardinality  of  g  (by  itself)  for  population  D.  In  turn,  the  arithmetic 
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division  of  these,  i.e.,  the  quantity  (f|g)COp,w  becomes  the  relative  fuzzy 
cardinality  of  f  "and"  g  for  D  compared  to  fuzzy  cardinality  of  g  for  D, 
i.e.,  the  overall  fuzzy  conditioning  of  f  to  g  with  respect  to  population  D. 

The  latter,  beginning  with  Zadeh’s  ideas  [20,  21],  followed  by  Dubois  & 

Prade’s  modifications  [22],  and  Kosko’s  related  concept  of  fuzzy  subset- 
hood  [23],  are  used  ubiquitously  in  the  fuzzy-logic  community  for  rea¬ 
soning.  In  this  process,  one  considers  the  premise  set  of  a  particular 
linguistic  entailment  of  interest,  the  latter  being  formally  the  same  as  the 
probability-framed  previous  G;  =  [(a|b)j;  (c;|d)],  but  now  where  each  (aj|bj) 
is  replaced  by  a  fuzzy  conditional  — in  its  general  form  the  same  as 
(fj lgj)cop,w  —formed  as  in  Eq.  (18),  now  with  f  replaced  by  fj,  g  by  gj  (for 
possibly  pre-logically  compounded  fuzzy-set  membership  functions),  j  in 
J;  and  with  similar  remarks  applicable  to  the  potential  conclusion  (cj  |d) 
replaced  by  (f0,ilgo)cop,w5  f°r  some  fuzzy  sets  fG ;,  gG,  etc.  But,  Theorem  2 
(with  suitable  modifications,  where  required)  essentially  shows  that  any 
such  (fj|gj)copjW  =  PHcop  |  £gj,coPX  wbh  a  similar  relation  holding  the 
potential  conclusion.  Moreover,  the  variability  of  P  subject  to  whatever 
arbitrary  but  fixed  levels  tj  are  set  for  the  premise  collection  holds  in  the 
same  meaningful  manner  as  in  the  case  where  one  began  the  problem  in  a 
probability  framework,  i.e.,  for  typical  entailment  schemes  of  the  form  G;. 

As  an  application  of  this,  suppose  one  considers  the  transitivity  scheme, 
which  Zadeh  has  also  considered  and  modeled  his  premise  set  as  indicated 
above,  but  has  used  a  method  solely  developed  within  fuzzy  logic  for 
determining  what  the  appropriate  conclusion  should  be  [21].  Thus,  three 
attributes  are  present,  where,  e.g.,  population  D  here  is  the  set  of  all 
enemy  ships  in  area  A,  "ships  with  type  1  weapons  onboard"  corresponds 
to  known  or  estimated  fuzzy-set  membership  function  f  over  D;  "ships 
with  elongated  hulls "  corresponds  to  known  or  estimated  fuzzy-set  mem¬ 
bership  function  g  over  D;  "ships  with  signature  pattern  Q"  correspon¬ 
ding  to  known  or  estimated  fuzzy-set  membership  function  h  over  D. 

Moreover,  other  truth  modifiers  may  be  present,  such  as  "it  is  mostly 
true,"  "it  is  somewhat  true,"  etc.  Here,  for  simplicity,  suppose  for  the 
premise  set,  one  actually  has  "it  is  highly  true  that  the  enemy  ships  in  A 
with  signature  pattern  Q  have  elongated  hulls,"  "it  is  moderately  likely 
that  an  enemy  ship  in  A  with  an  elongated  hull  has  type  1  weapons 
onboard."  Can  one  conclude  "it  is  x-likely  that  an  enemy  ship  in  A  with 
signature  pattern  Q  has  type  1  weapons  onboard,"  where  the  degree  of 
truth  x  is  to  be  determined?  Assume  that  "it  is  highly  true"  is  represented 
by  a  known  or  estimated  fuzzy-set  membership  function  M  over  [0,1], 
which  is  monotone  increasing,  "it  is  moderately  likely"  is  also  represented 
by  a  (different— not  as  steep  toward  1  as  M,  etc.)  known  or  estimated 
fuzzy-set  membership  function  N  over  [0,1],  where  M(r)  =  N(r)  =  r,  for  r 
=  0  or  1.  Hence,  for  any  arbitrary  levels  s,  t  in  [0,1],  the  conditional  fuzzy 
relations  here  are,  for  some  choice  of  copula  and  population  weighting 
function  w, 

M((f|g)cop,w)  =  s ,  N((g|h)cop>w)  =  t  iff  (f|g)cop>w  =  M-I(s) ,  (g|h)cop>w  =  N-I(t) 

iff,  using  Theorem  2,  P {a^  cop  |  £g;Cop)  =  M_1(s), 

P(^g,  cop  I  ch,cop)  =  N-!(t). 


(24) 
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Thus,  for  any  given  levels  s,  t,  one  can  now  consider  the  SOPL-estimate 
of  the  potential  conclusion  for  transitivity,  Pfhf,  cop  |  ^gjCOp)>  with  respect 
to  the  premise  set  above  at  thresholds  s,  t,  where  the  entire  entailment 
scheme  is 
G  = 

cop  I  ^g,cop)’  (^g,  cop  I  ch,coP);  H  cop  I  ch,coP)];  (25) 

meanconc(G)(M_1(s),  N_1(t))  =  EP(P (a^  cop  |  ch;Cop)  I 

PK  cop  I  ^g,cop)  =  M_1(s) ,  P(^g,  cop  I  ch,cop)  =  N_1(t))-  (26) 

In  turn,  Table  1  shows  that  under  a  uniform  distributional  assumption  on 
what  P  could  be,  subject  to  its  constraints  in  the  premise  set  of  G,  for  any 
given  s,  t  in  [1/2,  1] 

meanconc(G)(M_1(s),  N_1(t))  =  p(M_1(s),  N_1(t)), 
where,  for  any  s,  t  in  [V2,  1], 

p(s,t)  =d  st  +  (l-t)/2  -p(s,t)/q(s,t);  p( s,t)  =d  s(l-s)(2s-l)t(l-t2); 

<7(s,t)  =d  t+2t2  +  (s(l-s)(l-t)(2+3t-t2)),  (27) 

where, 

p(s,t)  =  p0(s,t)  =d  st  +  (l-t)/2  ,  for  values  of  s,  t  sufficiently  close  to  1.  (28) 

Hence,  the  posterior  conditional  (given  the  premise  constraints  for  any 
s,  t)  is  approximately  equal  to  p0(M-1(s),  N_1(t)),  which  can  be  interpreted 
also  as  a  truth  modifier  with  respect  to  two  variables,  noting  its  limit  is 
unity  as  s,  t  approach  unity,  etc.  Of  course,  all  of  the  above  applies  to  any 
fuzzy-logic  entailment  scheme  relative  to  the  original  premise  sets  utiliz¬ 
ing  overall  fuzzy  conditioning  for  some  population  D. 

Remark  3.  In  the  same  spirit  of  Theorem  2,  other  fuzzy-logic  concepts 
can  now  be  fully  interpreted.  Due  to  space  limitations,  only  the  example 
of  fuzzy  normalization  will  be  considered  here.  In  this  situation,  a  fuzzy 
membership  function,  say,  f:D— »[0,1]  is  given,  followed  by  its  normaliza¬ 
tion  function  norm(f):D— >[0,1],  which  is  now  obviously  a  legitimate  prob¬ 
ability  function  over  finite  population  D,  where 

norm(f)  =  (l/X(f(x))  )’f  .  (29) 

xinD 

But,  if  one  considers,  a  la  Theorem  1,  for  any  choice  of  copula  cop,  a 
probability  space  (£2,i>,P),  for  which,  without  loss  of  generality,  there  is 
both  a  random  set  S(f,  cop):£2— >P(D)  and  an  independent  random  variable 
V:D— >D  uniformly  distributed  over  D,  with  the  one-point  coverage  rela¬ 
tion  holding 

P(x  in  S(f,  cop))  =  f(x),  all  x  in  D,  (30) 

for  any  x  in  D,  specializing  Eq.  (23)  with  g  replaced  by  f, 

P(V  =  x  |  V  in  S(f,  cop))  =  P(V  =  x  and  x  in  S(f,  cop))  /  P(V  in  S(f,  cop)) 

=  ((l/card(D))-f(x))  (l/card(D))-g(x)}  =  norm(f)(x), 

xinD 


(31) 
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showing  fuzzy  normalization  is  actually  a  simple  conditional  probability 
restriction  of  the  two-stage  randomization  for  one-point  coverages.  A 
future  paper  will  deal  with  related  issues. 
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